Rank in Wordlist | Frequency | Word |
---|---|---|
3043 | 53538 | 1,000 |
3498 | 45787 | it,” |
4183 | 37368 | 10,000 |
4776 | 32002 | 100,000 |
4961 | 30581 | 2,000 |
5558 | 26540 | 5,000 |
6146 | 23297 | 30,000 |
6181 | 23162 | that,” |
6260 | 22747 | 50,000 |
6633 | 21210 | 20,000 |
Rank in Wordlist | Frequency | Word |
---|---|---|
3429423 | 1 | 10%(9.73% |
Rank in Wordlist | Frequency | Word |
---|---|---|
397548 | 45 | .) |
995697 | 11 | Sunn O) |
995698 | 11 | Sunn O)) |
995699 | 11 | Sunn O))) |
1884480 | 4 | New York Knicks) |
2470482 | 2 | .--) |
3471812 | 1 | 14%)/+16% |
3474127 | 1 | 14.9%)8.6% |
3502819 | 1 | 18%)/+14% |
3739400 | 1 | 6%)/+3% |
Rank in Wordlist | Frequency | Word |
---|---|---|
4527 | 34016 | 10% |
4835 | 31527 | 50% |
4873 | 31167 | 100% |
4890 | 31081 | 20% |
5669 | 25853 | 5% |
5922 | 24419 | 30% |
6367 | 22333 | 40% |
6943 | 19976 | 2% |
7356 | 18589 | 25% |
7604 | 17746 | 15% |
Rank in Wordlist | Frequency | Word |
---|---|---|
3802 | 41573 | S&P |
12452 | 9178 | R&D |
13437 | 8300 | Q&A |
15222 | 6929 | AT&T |
16107 | 6402 | M&A |
16229 | 6334 | A&E |
17815 | 5589 | A&M |
18188 | 5423 | R&B |
19594 | 4859 | M&S |
25234 | 3371 | SG&A |
Rank in Wordlist | Frequency | Word |
---|---|---|
54990 | 1048 | A$AP |
70439 | 711 | A$AP Rocky |
71661 | 692 | US$1 |
94588 | 446 | US$100 |
108538 | 359 | US$2 |
110213 | 350 | US$10 |
113496 | 334 | US$3 |
123174 | 293 | US$5 |
133547 | 257 | US$20 |
140360 | 237 | US$200 |
Rank in Wordlist | Frequency | Word |
---|---|---|
112 | 1079277 | ." |
Rank in Wordlist | Frequency | Word |
---|---|---|
283 | 475293 | it's |
442 | 333743 | It's |
597 | 261865 | don't |
701 | 227869 | .' |
721 | 223468 | I'm |
1004 | 167974 | that's |
1097 | 155544 | didn't |
1254 | 135740 | you're |
1268 | 134209 | we're |
1398 | 121798 | doesn't |
Rank in Wordlist | Frequency | Word |
---|---|---|
18227 | 5406 | Apple TV+ |
43919 | 1481 | Disney+ series |
79099 | 592 | Disney+ Hotstar |
80313 | 578 | APNU+AFC |
84705 | 534 | live+same |
100230 | 407 | Travel + Leisure |
101590 | 398 | Disney+'s |
111383 | 344 | TV+’s |
123382 | 292 | New Game+ |
133533 | 257 | TV+'s |
Rank in Wordlist | Frequency | Word |
---|---|---|
119481 | 307 | Grade II* |
162764 | 187 | Grade II* listed |
330560 | 61 | Sagittarius A* |
355783 | 54 | Grade II* listed building |
439248 | 39 | Sgr A* |
570564 | 26 | The S*n |
775817 | 16 | NOC * NSF |
975038 | 11 | Grade 2* |
997187 | 11 | The Subtle Art of Not Giving a F*ck |
1416189 | 6 | F**k You |
Rank in Wordlist | Frequency | Word |
---|---|---|
4061 | 38529 | P/E |
6000 | 23989 | and/or |
7158 | 19216 | P/E/G |
7261 | 18872 | https://www |
9180 | 13703 | https://t |
11894 | 9779 | 24/7 |
15558 | 6733 | 2023/24 |
17179 | 5852 | 9/11 |
17312 | 5795 | 2022/23 |
17823 | 5586 | 1/2 |
In the last subsection of this type we look for words containing other special characters: , ( ) % & $
" ' + * = / _
Depending on the language some of these characters may be allowed within words, other will not. If words with forbidden characters do not have very low frequency there might be a problem in preprocessing.
Words containing %:
select w_id-100,freq, word from words where w_id>100 and word like "%\%%" limit 10;
3.12.1 Words with Hyphens
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots